A Portuguese spoken and multi
نویسندگان
چکیده
This paper presents an overview of the spoken and multimodal dialog Portuguese corpora collected in the context of the FASiL (Flexible and Adaptive Spoken Language and Multi-Modal Interfaces) project. The project developed a Virtual Personal Assistant application in the Personal Information Management domain, exploiting the state-of-theart of speech and multi-modal technology. The FASiL corpora were collected to obtain acoustic and language modelling data for the spoken language components, to study adaptative dialogue management techniques and to support multi-modal interface research tasks. As spoken and multi-modal resources availability is key to the research, development and acceptability of new multi-modal services and applications the corpora will be available to interested parties through ELRA.
منابع مشابه
Prosody, syntax, and pragmatics: insubordination in spoken Brazilian Portuguese
In this paper, we approach the phenomenon of insubordination in spoken Brazilian Portuguese through data on adverbial clauses extracted from the C-ORAL-BRASIL corpus [Raso and Mello 2012]. Differently from the traditional conception of insubordination [Evans 2007], we propose a synchronic view, strongly based on the analysis of the prosody/pragmatics interface in spoken language. We show that i...
متن کاملA Rule Based Pronunciation Generator and Regional Accent Databank for Portuguese
One of the major obstacles in deploying spoken language technologies (SLTs) in the developing world is a lack of key linguistic resources – e.g. electronic dictionaries, phonetically aligned corpora, pronunciation lexicons, etc. – that describe the non-dominant varieties spoken in such countries and regions. In this paper, we describe the work of the LUPo (Portuguese Unisyn Lexicon) project to ...
متن کاملA BLARK extension for temporal annotation mining
The Basic Language Resource Kit (BLARK) proposed by Krauwer is designed for the creation of initial textual resources. There are a number of toolkits for the development of spoken language resources and systems, but tools for second level resources, that is, resources which are the result of processing primary level speech resources such as speech recordings. Typically, processing of this kind ...
متن کاملVariation and Diffusion of Uruguayan Portuguese in a Bilingual Border Town
In Northern Uruguay, along the Brazilian border, Portuguese was the only language spoken by the Portuguese and the Brazilian settlers. It was only at the end of the last century that Spanish was imposed upon the Portuguese speaking communities through state educational policies and language planning. Spanish was then brought to the border areas as a prestigious language, and rural Portuguese wa...
متن کاملSpoken Portuguese: Geographic and Social Varieties
The Spoken Portuguese: Geographic and Social Varieties project has as its main goal the Portuguese teaching as foreign language. The idea is to provide a collection of authentic spoken texts and to make it friendly usable. Therefore, a selection of spontaneous oral data was made, using either already compiled material or material recorded for this purpose. The final corpus constitution resulted...
متن کامل